Using Articulatory Knowledge in Automatic Speech Recognition
نویسنده
چکیده
Over the years different types of speech recognizers have been proposed and tested. During the last decade (or maybe even longer) hidden Markov models (HMMs) seem to have a better performance than other types of speech recognizers, like e.g. rule-based speech recognizers. This state of affairs has led to a gap between speech technology on the one hand, and phonetics and phonology on the other. Clearly, this is not an ideal situation, because both fields could and should benefit from each other. The important question then is: why is the performance of rule-based recognizers not as good as that of HMM-based recognizers? Probably this is due to a combination of several factors like: (1) at the moment, there isn’t enough knowledge available; (2) most of the knowledge available is derived from lab speech and cannot always be generalized to ther types of speech; (3) the kn owledge available is generally transformed into deterministic rules in rule-based systems, while HMM-based systems mainly use a kind of stochastic rules; (4) and finally, and probably most important, in many rule-based recognizers local decisions often have to be made (i.e. whether a segment is voiced or not), while in HMM-based systems one overall probabilistic decision is made.
منابع مشابه
Knowledge from Speech Production Used in Speech Technology: Articulatory Synthesis*
There appears to be a continuing trend toward incorporating knowledge of speech production into s~eech technology-text-to-speech synthesis (e.g., BIckley, Stevens, & Williams, 1994; Parthasarthy & Coker, 1992), low bit rate coding (see Schroeter & Sondhi, 1992), and automatic speech recognition (e.g., Rose, Schroeter, & Sondhi, 1994; Shirai & Kobayashi, 1986). For automatic speech recognition, ...
متن کاملA Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملA Multi-channel/multi-speaker Articulatory Database for Continuous Speech Recognition Research
The goal of this research is to improve the performance of a speaker-independent Automatic Speech Recognition (ASR) system by using directly measured articulatory parameters in the training phase. This paper examines the need for a multi-channel/multi-speaker articulatory database and describes the design of such a database and the processes involved in its creation.
متن کاملArticulatory feature based continuous speech recognition using probabilistic lexical modeling
Phonological studies suggest that the typical subword units such as phones or phonemes used in automatic speech recognition systems can be decomposed into a set of features based on the articulators used to produce the sound. Most of the current approaches to integrate articulatory feature (AF) representations into an automatic speech recognition (ASR) system are based on deterministic knowledg...
متن کاملLearning mixed acoustic/articulatory models for disabled speech
This paper argues that automatic speech recognition (ASR) should accommodate dysarthric speech by incorporating knowledge of the production characteristics of these speakers. We describe the acquisition of a new database of dysarthric speech that includes aligned acoustics and articulatory data obtained by electromagnetic articulography. This database is used to train theoretical and empirical ...
متن کاملAutomatic speech recognition using articulatory features from subject-independent acoustic-to-articulatory inversion.
An automatic speech recognition approach is presented which uses articulatory features estimated by a subject-independent acoustic-to-articulatory inversion. The inversion allows estimation of articulatory features from any talker's speech acoustics using only an exemplary subject's articulatory-to-acoustic map. Results are reported on a broad class phonetic classification experiment on speech ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1995